A constant-factor approximation algorithm for the k-median problem
نویسندگان
چکیده
We present the first constant-factor approximation algorithm for the metric k-median problem. The k-median problem is one of the most well-studied clustering problems, i.e., those problems in which the aim is to partition a given set of points into clusters so that the points within a cluster are relatively close with respect to some measure. For the metric k-median problem, we are given n points in a metric space. We select k of these to be cluster centers, and then assign each point to its closest selected center. If point j is assigned to a center i, the cost incurred is proportional to the distance between i and j. The goal is to select the k centers that minimize the sum of the assignment costs. We give a 6 2 3 -approximation algorithm for this problem. This improves upon the best previously known result of O(log k log log k), which was obtained by refining and derandomizing a randomized O(log n log logn)approximation algorithm of Bartal. We also give constant factor approximation algorithms for several natural extensions of the problem. [email protected]. Stanford University, Stanford, CA 94305. Research supported by the Pierre and Christine Lamond Fellowship, NSF Grant IIS-9811904 and NSF Award CCR9357849, with matching funds from IBM, Mitsubishi, Schlumberger Foundation, Shell Foundation, and Xerox Corporation. [email protected]. Stanford University, Stanford, CA 94305. Research Supported by IBM Cooperative Fellowship, NSF Grant IIS-9811904 and NSF Award CCR-9357849, with matching funds from IBM, Mitsubishi, Schlumberger Foundation, Shell Foundation, and Xerox Corporation. [email protected]. Cornell University, Ithaca, NY 14853. Research partially supported by NSF grants CCR-9700163 ONR grants N00014-98-1-0589 and N00014-96-1-0050. [email protected]. Cornell University, Ithaca, NY 14853. Research partially supported by NSF grants CCR-9700029 and DMS-9805602 and ONR grant N00014-96-1-00500.
منابع مشابه
Constant Approximation for Capacitated k-Median with (1 + ε)-Capacity Violation
We study the Capacitated k-Median problem for which existing constant-factor approximation algorithms are all pseudo-approximations that violate either the capacities or the upper bound k on the number of open facilities. Using the natural LP relaxation for the problem, one can only hope to get the violation factor down to 2. Li [SODA’16] introduced a novel LP to go beyond the limit of 2 and ga...
متن کاملConstant Approximation for Capacitated k-Median with (1+epsilon)-Capacity Violation
We study the Capacitated k-Median problem for which existing constant-factor approximation algorithms are all pseudo-approximations that violate either the capacities or the upper bound k on the number of open facilities. Using the natural LP relaxation for the problem, one can only hope to get the violation factor down to 2. Li [SODA’16] introduced a novel LP to go beyond the limit of 2 and ga...
متن کاملConstant-Factor Approximation for Ordered k-Median
We study the Ordered k-Median problem, in which the solution is evaluated by first sorting the client connection costs and then multiplying them with a predefined non-increasing weight vector (higher connection costs are taken with larger weights). Since the 1990s, this problem has been studied extensively in the discrete optimization and operations research communities and has emerged as a fra...
متن کاملApproximating k-median with non-uniform capacities
In this paper we give a constant factor approximation algorithm for the capacitated k-median problem. Our algorithm produces a solution where capacities are exceeded by at most a constant factor, while the number of open facilities is at most k. This problem resisted attempts to apply the plethora of methods designed for the uncapacitated case. Our algorithm is based on adding some new ingredie...
متن کاملAn approximation algorithm for Uniform Capacitated k-Median problem with 1 + ε capacity violation
We study the Capacitated k-Median problem, for which all the known constant factor approximation algorithms violate either the number of facilities or the capacities. While the standard LP-relaxation can only be used for algorithms violating one of the two by a factor of at least two, Shi Li [SODA’15, SODA’16] gave algorithms violating the number of facilities by a factor of 1 + exploring prope...
متن کاملInapproximability of the Asymmetric Facility Location and k-Median Problems
In the asymmetric versions of the uncapacitated facility location and k-median problems, distances satisfy the triangle inequality but the distances from point i to point j and from j to i may differ. For the facility location problem there is an O(logN) approximation algorithm due to Hochbaum. For the k-median problem, Lin and Vitter gave a bicriterion approximation algorithm that blows up the...
متن کامل